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Abstract 



A recent paper of Melbourne & Stuart, A note on diffusion limits of chaotic 
skew product flows, Nonlinearity 24 (2011) 1361-1367, gives a rigorous proof 
of convergence of a fast-slow deterministic system to a stochastic differential 
equation with additive noise. In contrast to other approaches, the assumptions 
►^ | on the fast flow are very mild. 

C\) ■ In this paper, we extend this result from continuous time to discrete time. 

Moreover we show how to deal with one-dimensional multiplicative noise. This 

\^ \ raises the issue of how to interpret certain stochastic integrals; it is proved 

that the integrals are of Stratonovich type for continuous time and neither 
Stratonovich nor Ito for discrete time. 

^^ \ We also provide a rigorous derivation of super diffusive limit where the 

stochastic differential equation is driven by a stable Levy process. In the case 
of one-dimensional multiplicative noise, the stochastic integrals are of Marcus 

^ | type both in the discrete and continuous time contexts. 



Keywords homogenisation, deterministic maps and flows, multiplicative noise, 
Stratonovich/Ito/Marcus stochastic integrals 

1 Introduction 

There is considerable interest in understanding how stochastic behaviour emerges 
from deterministic systems, both in the mathematics and applications literature. A 
simple mechanism for emergent stochastic behaviour is via homogenization of multi- 
scale systems, see for example [27] . 
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Recently, Melbourne & Stuart [23J embarked on a programme to develop a rigorous 
theory of homogenization based on new ideas in the theory of dynamical systems. The 
aim is to avoid excessive mixing assumptions on the fast dynamics, since these are very 
difficult to establish even for uniformly hyperbolic (Axiom A or Anosov) flows (general 
references for the ergodic theory of uniformly hyperbolic maps and flows include [U |5j 
I3CT1 [32J). Instead, the theory relies only on relatively mild statistical properties that 
are known to hold very widely and are independent of mixing assumptions (see [23J 
or Remark 12.11 below for further details). 

The mechanism for emergent stochastic behaviour in deterministic systems, 
whereby fast chaotic dynamics induces white noise in the slow variables, is much- 
studied in the applied literature, see for example [2J [JTJ [TBI CEO EZJ- See also the 
program outlined by [T3] . The aim here, continuing and extending the work in [23] , is 
to obtain rigorous results for large classes of fast-slow systems under unusually mild 
assumptions. 

In particular, [23] studied fast-slow ODEs of the form 

i (£) = e-Vo(y (6) ) + /(x (e) ,y (e) ), s< 6 >(0)=£ 

y^ = e- 2 g(y^), y®{0) = r,, (1.1) 

where x (e) E R d , y^ E R 1 , e > 0. It is assumed that the vector fields f :R e ^ R d , 
f : R d x R e — > R d and g : R e — > R e satisfy certain regularity conditions and that 
the fast y dynamics possesses a compact attractor A C M with ergodic invariant 
probability measure fi satisfying certain mild chaoticity assumptions. Finally, it is 
required that j A /o dfi = so that we are in the situation of homogenization rather 
than averaging. The conclusion [23] is that x^ e ' — )- w X in C([0, oo), R d ) as e — > 0, 
where X is the solution to a stochastic differential equation (SDE) of the form 

dX = VzdW + F(X)dt, X(0)=£. (1.2) 

Here W is unit d- dimensional Brownian motion and F(x,y) = J A f(x,y)dfi(y). 
(Throughout, we use — > w to denote weak convergence in the sense of probability 
measures [3j.) 

In this paper, we consider the twin goals of (i) allowing multiplicative noise when 
d — 1, and (ii) proving analogous results for discrete time to those for flows. 

In the introduction, we will focus on the case of discrete time. First we consider 
the case where there is no multiplicative noise. Consider the equation 



x 



W(n + 1) = z (e) (n) + c/o(y(n)) + e 2 f(x^(n), y(n), e), x^(0) = £, (1.3) 



where x^ e '(n) E R d and the fast variables y(n) are generated by a map g : R e — > M. £ 
with compact attractor Act' and ergodic invariant measure \x. We require that A 
is mildly chaotic as in [23] . That is, we assume a weak invariance principle (WIP) and 
a large deviation principle (LDP). The precise definitions are recalled in Section |2J 



A consequence is that n~ x l 2 Y^jZo foivU)) converges in distribution as n — > oo to a 
<i-dimensional normal distribution with mean zero and d x d covariance matrix E. 

Define x^(t) = x^ e \te^ 2 ) for t = 0,e 2 ,2e 2 , . . . and linearly interpolate to obtain 
x® GC([0,oo),M d ). 

Our first main result is a direct analogue of the continuous time result of Mel- 
bourne & Stuart [231 Theorem 1.1]. 

Theorem 1.1 Consider equation (I1.3J) . Assume that f and f are locally Lipschitz 
in x and y, and that \im e ^ f(x, y, e) = f(x,y,0) uniformly on compact subsets of 
M. d x A. Assume that A satisfies the WIP and LDP as described below. Suppose that 
J A f dfi = and set F(x) = f A f(x, y, 0) d/j(y) . 

Suppose that solutions X to the SDE (jl.2p exist on [0, oo) with probability one. 
Then x® ^ w X in C([0, oo), R d ) ase^O. 

Remark 1.2 (a) In [23] it was assumed that / (and hence F) is globally Lipschitz 
and so the condition on global existence for X is automatic. In the course of the 
current paper, such global conditions become rather excessive, so we relax them from 
the outset, see Subsection 13. II 

(b) In the generality of this paper, including Theorem 11.11 it is not possible to write 
down a formula for the covariance matrix E. However in many situations, including 
the Young tower situation [33 ES] discussed in Section [21 it is possible to prove 
convergence of second moments [21] leading to the expression 

S^nmn- 1 f (]T f (y(j))) (]T / (</(j)))V (1-4) 

Ja j=0 j=0 

Under the additional assumption of summable decay of correlations (which is valid 
for Young towers that satisfy the WIP and are mixing), we obtain the well-known 
Green-Kubo formula 

p OO p OO p 

s= / /o/ T ^ + E / fo(y(n))fo(y(o)) T df, + J2 / fo(y(o))fo(y(n)) T dfi. (1.5) 

•^ n=1 J A n=1 -/A 

In particular, we note that fll.4p is valid for general uniformly hyperbolic maps 
and (11.51) is valid under the additional assumption that the map is mixing. 

(We have written formula (11.51) so that it applies equally to invertible 
and noninvertible maps. Of course in the case of invertible maps E = 

En^ooI A fo(y(n))fo(y(o)) T d^.) 

Next, we turn to the case of multiplicative noise in the discrete time setting with 
d — 1. Consider the equation 

x^(n + 1) = x ie \n) + eh{x {e \n))fo{y{n)) + e 2 f{x {e \n), y{n), e), (1.6) 



where x^(n) e R and the fast variables y(n) are generated as before. 

As e — > 0, we expect that x^ e '(t) = x^(te~ 2 ) converges weakly to solutions X of 
an SDE of the form 

dX = h(X) dW + F(X) dt, 

but there is the issue of how to interpret the stochastic integral J h(X) dW. In the 
continuous time setting, one expects the limiting SDE to be Stratonovich J33J El], 
and this is indeed the case, see Theorem 13.31 However, the discrete time case is very 
different, as shown by Givon & Kupferman [9]. They considered the special case where 
h(x) = x and f(x,y,e) = Xx (A constant), and showed that in general the stochastic 
integral is neither Ito nor Stratonovich except in the special case where y(n) is an iid 
sequence - in that case the integral is Ito. Their proof exploited linearity in a crucial 
way. Here we extend their results, relaxing linearity and allowing / to depend on y. 

Theorem 1.3 Let d = 1 and consider equation (11.61) . Assume that f and f are 
locally Lipschitz in x and y, and that lim € _j. f(%, y, e) = f(%, y, 0) uniformly on com- 
pact subsets o/lx A. Assume that h : R — > R is C l and nonvanishing. Assume that 
A satisfies the WIP and LDP as described below. Suppose that J A /o dfj, = and set 

F ( x ) = J A f( x iy^) d v(y)- 

Consider the Stratonovich SDE 
dX = ah(X) odW + (F(X) - -h{X)h\X) J fl dfij dt, X(0) = «£, (1.7) 

where W is unit 1-dimensional Brownian motion. Suppose that solutions X to the 
SDE exist on [0, oo) with probability one. Then x^ -^- w X in C([0, oo),R) as e — > 0. 

Remark 1.4 Note as in [9] that the correction term in this SDE is Ito if and only 
if a 2 = J /q dp,. This holds in the independent case but is generally false. (For 
example, the Green-Kubo formula (II. 5p specialised to the case d = 1 yields a 2 = 
f A f 2 dy, + 2 £~ 1 J A / O (y(n))/ O (y(0)) d»). 

See Section 6] and also [121 126] for discussions concerning corrections that are 
neither Ito not Stratonovich. 

The assumption that h is nonvanishing means that we can write h = 1/r' where 
r is a monotone differentiate function. By a change of variables, Z = r(X), it is 
then possible to reduce to the situation where there is no multiplicative noise, see 
Sections 13.21 and 14.21 In the process the drift term F(X) in the SDE is transformed 
into F(Z) where 

F = (F/h)o r -\ 



(see Proposition) 

These considerations lead to an extension of Theorem 11.31 where h is not required 
to be nonvanishing. Instead we require only that there is a monotone differentiable 
function r such that r' = 1/h and such that £ = X(0) lies in the domain of r. (It 



suffices that h(£) 7^ 0, in which case we can choose r(x) = ff l/h(y) dy for x near £.) 
Define F = {F/h) o r _1 whenever this expression makes sense. 

Proposition 1.5 Assume the same set up as in Theorem 11.31 except that h is not 
assumed to be nonvanishing. Consider the SDE 

dZ = adW + F(Z)dt, Z(0) = r(£). (1.8) 

Suppose that solutions Z to the SDE exist on [0, oo) with probability one. Then 
solutions X to the SDE ( II. 7p exist on [0, oo) with probability one, and x^ e ' —> w X in 
C([0,oo),R) as e->0. 

For example, suppose that h(x) = x and f(x,y) = xq(x,y), F(x) = xQ(x). If 
£ > 0, then we choose r(x) = logx yielding F(z) = Q{e z ) which is well-behaved in 
many situations (eg. Q{x) constant, Q(x) = —x p , p > 0). The case £ < is similar 
with r(x) = log(— x). 

Remark 1.6 (Higher-dimensional multiplicative noise) A more general set- 
ting that includes both situations described above is where the slow equations have 
the form 

x ^(n + 1) = x {e \n) + eh{x ie) {n))f {y{n)) + e 2 f(x {e \n),y(n), e), x {e) {0) = £, 

where x {e \n) E R d , f Q : A ->■ R d , f : R d x A x R -> M. d and h : R d ->• L(M d ,M d ). 
Theorem 11.11 deals with the case h(x) = Id, d general, and Theorem 11.31 covers the 
case d = 1, h general. It is well known that multiplicative noise presents more 
serious problems in higher dimensions, and it will be necessary to make assumptions 
beyond the WIP and LDP in general. This is the subject of future work. However, 
the methods in this paper generalise immediately to the higher-dimensional situation 
whenever h has the particular form h = (Dr)' 1 for some C 2 diffeomorphism r : R d — > 
R d . The formulas in the discrete case are straightforward to derive but unpleasant to 
write down and we omit them here. The analogous result in the flow case is described 
in Remark 13.51 

Finally we consider the case where the fast dynamics is not sufficiently chaotic to 
support the central limit theorem. In this case, we can still hope to prove homoge- 
nization theorems but the limiting SDE is driven by a stable Levy process. Again, 
we obtain results for ODES and maps, with additive noise in general dimensions and 
multiplicative noise for d — 1. The interpretation of the stochastic integral in the 
limiting SDE is the same for both continuous and discrete time, and is of Marcus 
type (see Section |5] for further details.) 

The remainder of this paper is organised as follows. In Section [21 we recall the 
setting of [23] as regards the chaoticity assumptions on the fast dynamics, though now 
in the context of discrete time. In Section |3l we consider fast-slow ODEs, relaxing the 



uniformity of the Lipschitz conditions in [23] and permitting multiplicative noise for 
d = 1 (with a restricted extension to higher dimensions) . In Section HJ we consider 
fast-slow maps and prove the results stated in this introduction. In Section [5j we 
state and prove results where the limiting SDE is driven by a stable Levy process. In 
Section [61 we present some numerical results. Section [7] is a conclusion section. 

2 Assumptions on the fast dynamics 

In [23], we made mild assumptions on the fast dynamics that are satisfied by large 
classes of dynamical systems. The formulation there is for continuous time. Here we 
discuss discrete time (both contexts are required in this paper). 

Let g : A — > A where A is a compact subset of M. e and \x is an ergodic invariant 
measure supported on A. Given y = r\ G A, we define the fast variables y(n), n > 0, 
by setting y(n + 1) = g(y(n)). 

Let /o : A — > R d be a Lipschitz observable of mean zero. Define W n (t) = 
n ~^ Y^j=o fo(y(j)) for t = 0, i, |, . . . and linearly interpolate to obtain a continu- 
ous function W n : [0, oo) — >■ M d . We assume the weak invariance principle (WIP), 
namely that W n -^ w yT,W in C([0, oo), IR d ) where W is unit d- dimensional Brownian 
motion and E is a d x d covariance matrix. 

Next, let / : R d x A ->■ R d be globally Lipschitz and define F : R d -> M d , 
F(x) = j A f(x,y) d/j,(y). We assume the following Zarge deviation principle (LDP): 

1 Ar_1 
^(b E /(*' VO')) - ^)| > a ) < Ka, N), 

3=0 

where b(a, N) is independent of x and b(a, N) — > as A^ — > oo for all a > 0. 

Remark 2.1 As discussed in [23l Remark 1.3(a)], the WIP and LDP hold for a large 
class of maps and flows. These include, but go far beyond, Axiom A diffeomorphisms 
and flows, Henon-like attractors and Lorenz attractors. Young [35l [36] introduced 
a class of nonuniformly hyperbolic maps with exponential and polynomial decay of 
correlations. For maps, the WIP holds when the correlations are summable and the 
LDP holds in all cases. For flows, it suffices that there is a Poincare map with these 
properties and then the WIP and LDP lift to these flows (irrespective of the mixing 
properties of the flow). Precise statements about the validity of the WIP and LDP 
can be found in |20l [2T1 [221 . 



These mild chaoticity assumptions are used throughout this paper in exactly the 
same way as in [23]. For example, in the situation of (11. IB . the WIP suffices when the 
slow coupling term f{x^\y^) is independent of y^ e \ (The argument can be found 
in [23] and is also present in the proof of Theorem ll.il in Section H~T1 ) The LDP is 
required to control the y( e > dependence in f(x^ e \y^), see Lemma [A. 21 
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3 Extensions of the results for flows 

In this section, we extend the results in [23] by (i) relaxing the global Lipschitz 
conditions and (ii) allowing multiplicative noise. 

3.1 Relaxing the global Lipschitz condition on / 

Consider the fast-slow system (II. ip . We suppose throughout that the fast equations 
possess a "mildly chaotic" compact attractor Acl' satisfying the WIP and LDP. 
It is natural to assume that / : A — > R d , f : R d x A — > R d and g : A — > A are 
locally Lipschitz to ensure existence and uniqueness of solutions to the various initial 
value problems arising above. Boundedness and uniformity of Lipschitz constants 
on A then follows from compactness. However, in [23] it is further assumed (mainly 
for simplicity) that / is bounded with a uniform Lipschitz constant on the whole of 
R d x A. 

In this subsection, we show that the result of [23] holds without the global Lipschitz 
condition provided solutions to the limiting SDE exist for all time with probability 
one. The formulation of |23j Theorem 1.1] is unchanged if in addition solutions to 
the fast-slow system exist for all time for /x-almost every r\. Otherwise, we require the 
following modification. Throughout we regard £ as fixed. 

Let [0, r e ] be the maximal interval of existence for a solution x^ and define 

X M(t) = P )(t) ' °^^/ 2 

* [) \*W(r 6 /2), t>r e /2 ■ 

(If x^' exists on [0, oo), then set x* = x^ e \) We say that x^ e ' converges weakly 
to X in C([0, oo),M d ) if r e — > oo in probability and x* converges weakly to X in 

C([0,oo),R d ). 

Theorem 3.1 Assume that the fast equation (with e = 1) has a mildly chaotic 
compact invariant set A with invariant ergodic probability measure fi. Suppose that 
/o : A — > R d and f : R d x A — > R d are locally Lipschitz, and that f. f d/j, = 0. Define 
F(x) = J A f(x, y) dn(y) and let (elf 

Let x( € \y( € ' denote the solutions to the fast- slow system (II. ip . Assume that solu- 
tions X to the SDE ( II. 2p exist on [0, oo) with probability one. Then x^ —} w X in 
C([0,oo),R d ) ase^O. 

In the remainder of this subsection, we prove Theorem 13.11 by reducing it to the 
situation in [23J . The ideas are standard, but care has to be taken since we are talking 
about weak convergence of solutions, and solutions to the fast-slow equation can blow 
up arbitrarily quickly. 

Let R > and define fa : R d x A — > R d to be a globally Lipschitz function that 
agrees with / on -Br(£) = {x G R d : |x — £| < R} (where \x — £| is Euclidean distance). 

7 



Let x^' R , y( e > denote solutions to (II. ip with / replaced by f R . These solutions exist 
for all time; in particular x^' R G C([0, oo), R d ). Similarly, let X R denote the solution 
to the SDE (11.21) with F replaced by F R where F R (x) = j A fn(x, y) d[i(y). (Note that 
E depends only on f and g and hence is independent of R.) By [23], x^' R — > w X R 
inC([0,oo),R d ) ase^O. 

Next, we define stopping times t r , t^ r G (0, oo) U {oo} where t r > is least 
such that X R G dB R (£) = {x G R d : \x — £| = R} and r e>R > is least such that 
x (e),n e dB R (£). By construction, X R = X on [0,Tr] and x (e),i? = x (e) on [0,r e , fl ]. 

Proposition 3.2 r e — >• oo, tr — > oo and r^ -^ oo in probability as R —$■ oo, e — >■ 0. 

Proof By assumption t r — > oo almost surely, and hence in probability as R — >■ oo. 
In other words, for any T > 0, 5 > 0, there exists i? such that P{t r > T) > 1 — 5. 

Next, for fixed R > 0, given t> G C([0, oo], R) we set ^(f) = sup{£ > : v{t) G 
B R (v(0))} G (0, oo) U {oo}. This defines a continuous map ip. Moreover, t^ r = 
ip(x^' R ) and t r = ip(X R ), so it follows from the continuous mapping theorem that 
r e ,R ->d r fl as e -)• 0. 

In particular, with T and i? as in the first paragraph, we have that there exists 
e > such that [i{t €)R > T) > P(t r > T) - 5 > 1 - 25 for all e G (0, e ). Altogether, 
we have shown that for any T > 0, 5 > 0, there exists R and eo such that h{t CjR > 
T) > 1 — 25 for all e G (0, e ), so r eR — > oo in probability. 

Since r t > r e>R for all R, it follows immediately that r e — > oo in probability. ■ 



Now fix T > 0. By Proposition 13.21 for any 5 > there exists eo and R such 
that /^(r £) fl > 2T) > 1 — 5 for all e G (0, eo). We choose i? so that in addition 
fi(r R >2T)>l-5. 

For this choice of R, let X? = X R and x^ = x^' R . Then we have defined 
families of random elements Xt and Xj, in C([0, oo),R d ) such that x^ — $ w Xt- 
Moreover, neglecting a set of measure 5, we have r e > r ejj R > 2T so r e /2 > T and 
hence x^ = £ = x* on [0,T]. Finally, neglecting a set of measure 5, we have 
Xt = X. Hence xt converges weakly to X in C([0,T),R d ). Since T is arbitrary, 
xl e) ^XinC([0,oo),R d ). 

3.2 Flows with multiplicative noise 

Next, we consider the case of multiplicative noise when d = 1. Consider the fast-slow 
system 

i« = e-^Cr^/oCyW) + /(^^W), X W( ) = £ 

y w = e"W e) ), y {e) (Q) = V, (3.1) 

with x (e) G R, y (e) G Rf 



Theorem 3.3 (a) Assume that g, fo, f, F, W and S are as in Theorem \3.1\ (but 
with the restriction that d = 1 and X is denoted by a 2 > 0). Suppose that h : R — > R 
is C l and nonvanishing. 

Let £ G R and consider the Stratonovich SDE 

dX = ah(X)odW + F(X)dt, X(0) = £. (3.2) 

Let x^ e \y^ e ' denote the solutions to the fast-slow system (13. ip . Assume that 
solutions X to this SDE exist on [0, oo) with probability one. Then x^ —} w X in 
C([0,oo),R) ase-^0. 

(b) More generally, assume the above set up but without the assumption that h is 
nonvanishing. Suppose that we can write h = 1/r' on an interval containing £. Write 
F = (F/h) o r _1 where defined. Suppose that solutions Z to the SDE ( 11.81) exist on 
[0, oo) with probability one. 

Then solutions X to the SDE (13.21) exist on [0, oo) with probability one, and 
i< e ' ^ w X in C([0, oo), R) as e ^ 0. 

Theorem 13.31 is proved by reducing it to Theorem 13.11 We require a preliminary 
elementary result. 

Proposition 3.4 Let r : R — > R be a C 2 diffeomorphism. Suppose that W is a 
one- dimensional unit Brownian motion and a 2 > 0. Consider the Stratonovich SDE 

dX = a{r'(X))- 1 odW + F(X)dt J X(0) = £. 

Then X is a solution to this SDE if and only if Z = r(X) satisfies the SDE 

dZ = adW + F(Z)dt 1 Z(0)=r(O, 

where F = (r'F) o r -1 . 

Proof Suppose that X satisfies the first SDE. Since the Stratonovich integral satisfies 
the usual chain rule, Z = r(X) satisfies 

dZ = r'{X) odX = adW + r'(X)F(X) dt = o dW + F{Z) dt. 

The converse direction is identical. ■ 

Proof of Theorem 13.31 Write h = 1/r' and let Z = r(X) where X satisfies the 
SDE ( 13. 2p . The assumptions on h guarantee that r is a C 2 diffeomorphism. By 
Proposition E31 Z satisfies the SDE ( Q|l . 

Next, let z^ e > = r o x^ e \ The x^ e > equation in (13. ip becomes 

z^\t) = e-VoCw^C*)) + f(z^(t),y^(t)), (3.3) 



where 

f(z,y) = f(r~ 1 z,y)/h(r~ 1 z). 

Since F(z) = j A f(z, y) dfi(y) and / is locally Lipschitz, we are now in the situation 
of Theorem 13.14 and it follows that solutions of (13. 3p converge weakly to solutions 
of fll.8p . That is, r o x^ e > — > w r o X. Applying the C 1 map r _1 , it follows from the 
continuous mapping theorem that x® — »•«, X as required. ■ 

Remark 3.5 As mentioned in the introduction, this result has a restricted exten- 
sion to higher dimensions. Consider the fast-slow equations (13. ip with x^ E K d 
and suppose that h : M. d — > L(R d ,IR d ) can be written as h = (Dr)~ l for some C 2 
diffeomorphism r : W. d — > IR d . Then the substitution z^ = r(x^) yields the equa- 
tion (13.31) with f(z,y) = h(r~ 1 z)~ l f(r~ 1 z,y). Again it follows from Theorem 13.11 
that z& ^ w Z where dZ = \/tdW + F(Z) dt, F(z) = h(r- 1 z)~ 1 F{r- 1 z). The 
change of variables formulas for Stratonovich SDEs shows that X = r~ x {Z) satis- 
fies dX = yTih(X) o dW + F(X) dt. Again x^ e ' —t w X by the continuous mapping 
theorem. 

4 Proofs of the results for maps 

In this section we prove the discrete time homogenization results stated in the intro- 
duction. By the argument in the proof of Theorem 13.11 we may suppose from the 
outset that all relevant Lipschitz constants are uniform on the whole of M. d and that 
lim e ^o f(x, V-, e) — f{x, V, 0) uniformly on the whole of M d x A. 

Throughout, it is more convenient to work with the piecewise constant function 
x( e \t) = x^([te~ 2 ]) rather than with the linearly interpolated function x^(t). (Here 
[te~ 2 ] denotes the integer part of te~ 2 .) Since the process x^ e '(t) = x^ e \[te~ 2 ]) is 
not continuous, we can no longer work within C([0, cxo),M d ). Instead we prove weak 
convergence in the space D([0, oo), R d ) of cadlag functions (right-continuous functions 
with left-hand limits, see for example [3J Chapter 3]) with the supremum norm. 

It is clear that sup t6 r 0iT i \x^(t) — x^ e '(t)\ — > as e — > 0. Hence weak convergence of 
x^ in -D([0, oo),R d ) is equivalent to weak convergence of x^ in D([0, oo),R d ). This 
in turn is equivalent to weak convergence of x^ in C([0, oo), R d ) (see for example the 
last line of p. 124 of 0). 

4.1 Proof of Theorem 11.11 

Write d(e) = sup a . eR d jyeA \f(x, y, e) — f(x, y, 0)|, so lim e ^ d(e) = 0. First note that 

n— 1 n— 1 

x^(n)=e + e^/o(y(j)) + e 2 ^/(^(j),l/(j),e). 

j=0 j=0 
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Hence 

[te- 2 ]-l 



^)(t)=t. + W M(t) + e 2 J2 f(x {e \j)MJ),Q) + Kl e \t) 



[t6- 2 ]-l 

^ + ^(i)+4 ) (t) + 4 ) (t) + e 2 £ F(x (e) (e 2 j)). 

i=o 

where W*>(*) = eE^ 1 " 1 /o(l/0')), I^WI < r ^0), and 

[te- 2 ]-l 

i^ e> (t) = e a £ (/(xW(j),2/(j),0)-F(xW(j))). 
i=o 

For £ an integer multiple of e 2 , the term e 2 E'l ^(^ ( e2 i)) is the Riemann sum 
of a piecewise constant function and is precisely L F(x^(s)) ds. For general t, 

[te" 2 ]-l 
2 



e 

3=0 



where Kg 6) (i) < e 2 ]/]^. Altogether, 

£(«)(*) = £ + H/ (e) (t) + f^(£) + f F(5 (e) (s)) ds, 

where K® = K[ e) + K { 2 e) + K { 3 e) . 

By exactly the same argument as in [23], K% — > in L 1 (D([0,T],IR d ); //). (For 
convenience the proof is reproduced in Appendix [A]) It follows that W^ + K^ — > 
V^W in D([0,T\,R d ). Now consider the continuous map Q : D([0,T\,R d ) -> 
£>([0,T],R d ) given by 0(u) = u where v(t) = £ + u(t) + f*F(v(s))ds. Then 
i (e) = £(W/ (e) + K&), so x (e) ^ w </(VEW) = X. This completes the proof of 
Theorem II. 11 



4.2 Proof of Theorem 11.31 and Proposition 11.51 

Again, we reduce to the case where F is globally Lipschitz and uniformly continuous 
at e = 0. Also, we may suppose that r" is uniformly continuous. 

Write Z = r(X) where X satisfies the SDE (\1.7\i . By Proposition 13.41 Z satisfies 

dZ = odW + F(Z) dt, (4.1) 
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where 

F(z) = r\r- l z)F{r' 1 z) - -ti{r~ l z) I f 2 d\t 

= r'(r- l z)F(r- l z) + V y '{r' 1 z)[r' '{r' 1 z)}~ 2 f f 2 dfM. 

Define z^(n) = r(x^ e '(n)). Using Taylor's theorem to expand the C 2 map r, we 
obtain 

z {e \n + 1) - z {e \n) = r(x {e \n + 1)) - r(x {e \n)) 

= r'(x (t \n))(x {e) {n + 1) - x (e \n)) (4.2) 

+ \r'\x ie \n))(x^(n + 1) - x {e \n)f + o((x^(n + 1) - x^(n)) 2 ), 

where the last term is uniformly o(e 2 ) since r" is uniformly continuous. 

Substituting for x^ e '(n + 1) — x^ e '(n) using equation (ll.6p . equation (j4.2p becomes 

^ (£) (n + 1) - z (e) (n) = ef (y(n)) + e 2 {r'(a; (e) (n))/(a; (e) (n),y(n), 0) 



+ ^(a;W(n))[r / (xW(n))]- 2 /o(y(n)) a + o(l) 
= e/ (y(n))H-e 2 /(^(n),y(ri),e), 

where 

/>, y, e) = /(r-^J/Cr-^, y, 0) + ^'(r" 1 *) [r' '{r^ z)Y 2 f (y) 2 + o(l), 

uniformly in z,?/ as e -> 0. Since F(z) = f A f(z,y,0)d/j,(y), it follows from Theo- 
rem [1J] that z^{t) = z^([te^ 2 ]) converges weakly to solutions Z of the SDE (j4.ip . 
Applying the C 1 map r _1 , it follows from the continuous mapping theorem that 
x^ — > w X (and hence x^ — > w X) as required. 

5 SDEs driven by stable Levy processes 

In this section, we consider the situation where the fast dynamics is not sufficiently 
chaotic to support the WIP. With reference to Remark 12.11 this occurs when the 
Young tower [HE] modelling the map (or the Poincare map in the case of flows) has 
nonsummable decay of correlations. In this case, the LDP still holds as before [T$ll2"%] 
but weak convergence to Brownian motion fails. However there are instances where 
instead there is convergence to a stable Levy process. 
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The prototypical examples are provided by Pomeau-Manneville intermittency 
maps [2H]. For definiteness, consider the maps g : [0, 1] — » [0, 1] studied by 



Jy(l + 2V), ye[0,J) 

[2y-l, ye[i,l] 

For 7 e (0, 1), there exists a unique absolutely continuous invariant ergodic probabil- 
ity fj,, and correlations decay at the rate n~^ ~ x \ The attractor is A = [0, 1]. 

In particular, correlations are summable if and only if 7 < |, and in this situation 
all of the results in the previous sections apply. From now on, we suppose that 
7 G (|, 1). Suppose that /o : A — > M. d is Lipschitz with J.f d/J, = 0, and assume 
further that /o(0) 7^ 0. Then Gouezel (TO] proved that the central limit theorem fails 
and instead that n -7 X^j=o foil/U)) converges in distribution to a stable law Y of 
exponent I/7. (More precisely, it follows from [10] that if c G M d and c • /o(0) 7^ 0, 
then n~ 7 5^?=o c ' fo(y(j)) converges in distribution to a 1-dimensional stable law of 
exponent I/7. Hence there is convergence in distribution to a d- dimensional random 
variable Y and c ■ Y is stable of exponent I/7 for all c. By [3TI Theorem 2.1.5(a) or 
(c)], Y is a (i-dimensional stable distribution, and its exponent is I/7 by [31] Theorem 
2.1.2].) 

Let G = Cri/ 7 denote the corresponding stable Levy process (independent and 
stationary increments with G(t) =d VY for each t and sample paths lying in 
D([0, 00), R d )). Then it follows from [25] that W n (t) = n^ ^S) * /o(l/0')) converges 
weakly to G in D([0, 00), R d ) with the Skorokhod .Mi topology. 

We proceed to consider fast-slow systems where the fast dynamics satisfies the 
LDP and weak convergence to a stable Levy process of exponent I/7. First, consider 
the fast-slow ODE 

±M = e n(a;W)/o(l/ (t) ) + /(x (e) ,?/ (e) ), xW(0) = £ 
y (e) = e-W e) )> Z/ (£) (0) = ^. 

If /i(x) = 1, then exactly the same arguments as before yield that x^ e ' —} w X 
where X satisfies the SDE 

dX = dG + F(X)dt, X(0)=£. 

If h = 1/r' is nontrivial and d = 1, then the same argument as before shows that 
x^ e ' —t w X where X = r(Z) and Z is the solution of the SDE 

dZ = dG + F(Z) dt, Z(0) = f , 

where JF = (F/h) or 1 . Transforming back, it is immediate that X satisfies the SDE 

dX = h(X)odG + F(X)dt, X(0) = £, (5.2) 
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provided that the stochastic integral h(X) odG satisfies the usual chain rule. For Levy 
processes, it turns out that neither the Ito nor Stratonovich interpretation is suitable, 
and the correct integral is due to Marcus [15] (see also [13]). See (U p. 272] for a 
discussion of the Marcus stochastic integral and in particular for the chain rule jl] 
Theorem 4.4.28]. 

For maps, we consider 

x ^(n + 1) = x {e) {n) + e 7 /i(x (e) H)/ (2/H) + ef{x^{n), y(n)), 

Set x^(t) = x^ ([te -1 ]) . If h = 1, we obtain again that x^ e ' converges weakly to 
solutions X of the SDE 

dX = dG + F(X) dt, X(0) = f . 

If h is nontrivial and d = 1, then we again obtain the Marcus SDE (15.21) . (Note 
that the second order expansion of r yields terms of order 27 which are negligible 
since 7 > 5. Hence the additional correction terms that arose in the discrete case for 
Brownian noise are absent for Levy noise.) 



6 Numerical validation 

In this section, we illustrate Theorem 11.31 with a numerical simulation of a suitable 
fast-slow map. For the fast dynamics we could consider a Pomeau-Manneville inter- 
mittent map as in (15. lft . with 7 G [0, |) so that the WIP and LDP are satisfied. In 
order to satisfy the centering condition J.fodfi = 0, it is more convenient to work 
with the following modified version of the map in (15. lft : 

'2/(1 + 2V), z/e[0,i) 

g(y)--{l-2y, </G[±,l] . (6.1) 

-g(-y), ye [-1,0) 

Again, for 7 G [0, 1) there exists a unique absolutely continuous invariant ergodic 
probability measure /x, Moreover, the WIP and LDP again hold for 7 G [0, |); we 
choose 7 = 0.1. The attractor is A = [— 1, 1]. 

Since the map g : A — > A is odd, the probability measure \x is symmetric around 
the origin. Hence the condition J A fodfi = is automatically satisfied provided 
/o : A — > R is odd. We choose 

fo(y)=y, h(x)=x*, f(x 1 y ) e) = \(\-x)y 2 , 

so the slow dynamics is given by 

x®(n + 1) = x (e) (n) + ex {e) {ri)5y{n) + e 2 ±(§ - x {e \n))y(n) 2 . (6.2) 
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According to Theorem II .3[ rescaled solutions x^ e '(t) = x^(te 2 ) of (16.21) converge 
weakly to solutions of the SDE 

dX = aX^ odW + -(- -X\ I y 2 dfidt- - / y 2 d\x dt 

= aX^ dW + -(--X\ y 2 dfidt+- (a 2 - / y 2 d/i) dt, 

= aX^ dW + a(P - X) dt, (6.3) 

where W is unit 1-dimensional Brownian motion and 

a 2 = J A y(0) 2 dix + 2 ZZi L v(n)y(0) dp = lim^ n" 1 / A (££[ vU)? dp, 
a = lJ A y 2 dfx, (3 = \{l + a 2 /a). 

This is the Cox-Ingersoll-Ross (CIR) model [HI [7] which has the closed form solution 

X(t) = c(t)H(t), c(t) = ^(l-e- at ), (6.4) 

where H(t) is a noncentral x _sc l uare d distribution with 4a/3/a 2 degrees of freedom 
and noncentrality parameter c(t)~ 1 e~ at !;. 

A long time iteration of the map (I6.ip . taking an ensemble average, yields the 
approximate values a 2 = 0.085 and j A y 2 d\x = 0.319, and hence a = 0.160, /3 = 0.383. 

A consequence of weak convergence is convergence in distribution for each fixed 
t > 0, namely that for any a G R, 

lim/i(x (e) (t) < a) = P(X(t) < a). 

We proceed to verify this result numerically with t = 10 and the initial condition 
£ = X M(0) =A(0) = 1. 

Computing the probability density function for x^(10) from the numerical sim- 
ulation of the full fast-slow system, and the limiting probability density function for 
X(10) using the closed-form solution (16.41) . we obtain the results shown in Figure [TJ 
We used ensembles consisting of 5, 000, 000 realizations (though for the fast-slow sys- 
tem we found that 100, 000 realizations were ample). 



In particular, Figure [TJ confirms our prediction regarding the drift term 



1(3 

2V4 



X) J A y 2 dfi + j (a 2 — J A y 2 dfi) in the limiting SDE (16. 3 j) . Figure |2] shows a comparison 
of this probability density function with those that would result from having the 
incorrect drift terms |(| — X) J A y 2 d\x or |(| — X) J A y 2 d\i + \a 2 that arise when the 
limiting SDE is interpreted as being Ito (as in the iid case) or Stratonovich (as in the 
continuous time case). 

We also present numerical evidence for convergence of first moments E(|£( e )(£)|). 
This is not a direct consequence of Theorem 11.31 However, for each t > 0, Theo- 
rem 11.31 together with boundedness (as e varies) of a higher moment implies (eg. [HI 
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Figure 1: Probability density function (empirical measure) at t = 10 for the fast-slow 
map (EH), (E2D with e = 0.8, 0.4, 0.2 (dashed, red) and for the SDE limit (JOJ (solid, 
blue). We used ensembles consisting of 5, 000, 000 realizations. 
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Figure 2: Probability density function (empirical measure) at t = 10 for the limiting 
SDE with different drift terms corresponding to three different interpretations of the 
stochastic integral: the theoretically predicted drift term in (16. 3p (left, blue) and 
those for the Ito interpretation (middle, red) and the Stratonovich interpretation 
(right, green). We used ensembles consisting of 5, 000, 000 realizations. 



Exercise 2.5, p. 86] that E(|:r( e )(£)|) converges, as e — > 0, to 

E(X{t)) =£e- at + f3(l-e~ at ). 



(6.5) 



We verified numerically that W,((x^(t)) 2 ) is convergent and hence bounded, implying 
convergence of E(|x^^(t)|) as demonstrated in Figure El 

7 Conclusions 

The paper [23] set out a programme for a rigorous investigation of homogenisation for 
fast-slow deterministic systems under very mild assumptions on the fast dynamics. 
In this paper, we have extended these results in a number of ways: 

1. Extension from continuous time to discrete time. 

2. Incorporation of one-dimensional multiplicative noise. 

3. Inclusion of situations where the SDE is driven by a stable Levy process. 

4. Relaxation of regularity assumptions, in particular the requirement in |23j that 
certain vector fields are uniformly Lipschitz. 
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Figure 3: First moment < t < 

0.8, 0.4, 0.2 (dashed, red) and for the SDE limit 



15 for the fast-slow map (16. II) . ( 16.21) with e = 
3) (solid, blue). We used ensembles 



consisting of 100, 000 realizations for the fast-slow map. 



Items 2-4 require interpretation of certain stochastic integrals. In the case of 
multiplicative noise in continuous time, the stochastic integrals are of Stratonovich 
type as would be expected. For discrete time, it was pointed out in [9] that the 
integrals are neither Ito nor Stratonovich. We recover their result in a much more 
general context. For multiplicative noise in the situation of item 3, where the SDE is 
Levy, we obtain Marcus stochastic integrals for both discrete and continuous time. 

Important directions of future research include the analysis of higher-dimensional 
multiplicative noise and of fully coupled systems where the fast dynamics depends on 
the slow variables. 

A Large deviation calculation 

We describe the implications of the LDP that are required in this paper. The proofs 
are identical to those in the continuous time context in [23] and are included for 
completeness. 

Proposition A.l For any a > 0, N > 1, n > ; x 6 M d , 

(n+l)iV-l 

E|- Y, f&yU)) ~ Hz)\<a + 2\f\ 00 b(a,N), 

j=nN 



Proof The proof is the same as the one in [23] for continuous time and is included 
for completeness. Let S(N,x) = \jjJ2j=o f( x ^y(j)) ~ F( x )\- Then by the LDP, 



ES(N,x)= / S(N,x)dfi + / S(N,x)dn 

JS{N,x)<a JS(N,x)>a 

<a+ |5U6(o, N) < a + 21/1006(0, N). 

This proves the result for n = and the general case follows from invariance of ft. I 
Let .fQ be the expression appearing in the proof of Theorem II .11 in Subsection 14. II 

Lemma A. 2 K^ ] ^0 m L\D([0, T},R d ); fj). 

Proof Define Q(x,y) = f(x,y, 0) —F(x) and note that \Q\00 < 2\f\oo and Lip(Q) < 
2Lip(/). Then K { 2 e \t) = f? , £ < j< [u-*lQfr {e) ti),vU))' Let N = W S ) and write 
K^\t) = K$\N8) + h where I Q = e 2 E^-^te- Q(^ e) U),yU))- We have 

|/o|<(<-JV«J)|g| 00 <2|/| 00 «J. (A.l) 



19 



We now estimate K\ (NS) as follows: 

N-l 

n=0 nSe- 2 <j<{n+l)5e- 2 

N-l 

= ^ E E (Q(x ie) (j)MJ)) - Q(x { '\n5e- 2 ),y(j))) 

n=0 n6e- 2 <j<(n+l)6e- 2 
N-l 

+ e 'E E Q(x (e \n6e- 2 ),y(j)) 

n=0 n6e- 2 <j<(n+l)Se- 2 
= h+ h- 

¥om5e- 2 <j< (n+l)5e- 2 , we have \x^(j)-x^(nSe~ 2 )\ < (e|/o|oc+e 2 |/U)<$e- 2 . 
Hence 

|/i| < iV5Lip(g)(|/ |oo + \fUSe- 1 < 2Lip(/)(|/oU + IfUTSe' 1 . (A.2) 

Next, 

N-l N-l 

J 2 = e ' E E (f(x^\n6e- 2 ),y(j),0) - F(x^(n6e- 2 )) = 6 £ J n , 

n=0 nSe- 2 <j<{n+l)5e- 2 n=0 

where 

Jn = e 2 6~ 1 Yl f(x^(nSe- 2 ),y(j),0) - F{x^\n5e- 2 )). 

n5e- 2 <j<(n+l)8e- 2 

Let a > 0. By Proposition |AJJ E|J n | < a + 2\f\ oc b(a, 5e~ 2 ). Recalling that 
iV = [t/S] we have that 

[T/S]-l 

sup \I 2 {t)\ <5 V \J n \. 
*e[o,T] n=0 

It follows that 

E sup \I 2 (t)\<T(a + 2\f\ 00 b(a } 5t- 2 )). (A.3) 

i€[0,T] 

Finally, we set 5 = es. By (jA.ip and (IA.2[) . lim e ^. Esup tG r 0)T i \Io(t)\ = 
lim e _>.oIEsup te [ 0)T ] \Ii(t)\ = 0. Moreover, by (1A.3J) and the assumption on b in the 
LDP, we have 

limsupE sup |/ 2 (t)| < Ta, 

£-►0 t£[0,T] 

and, since a > is arbitrary, 

limE sup \I 2 {t)\ = 0. 
e ^° te[o,T] 

Altogether, lim e _>. Esup tg [ 0)T ] |iQ (t)\ = proving the claim. ■ 
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